Picture for Tongyao Zhu

Tongyao Zhu

Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging

Add code
May 08, 2025
Viaarxiv icon

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Add code
Mar 19, 2025
Figure 1 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Figure 2 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Figure 3 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Figure 4 for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
Viaarxiv icon

Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas

Add code
Mar 04, 2025
Figure 1 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 2 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 3 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Figure 4 for Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
Viaarxiv icon

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Add code
Feb 18, 2025
Figure 1 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 2 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 3 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Figure 4 for Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Viaarxiv icon

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Add code
Nov 20, 2024
Figure 1 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Figure 2 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Figure 3 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Figure 4 for When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
Viaarxiv icon

CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization

Add code
Oct 16, 2024
Figure 1 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
Figure 2 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
Figure 3 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
Figure 4 for CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
Viaarxiv icon

Beyond Memorization: The Challenge of Random Memory Access in Language Models

Add code
Mar 13, 2024
Figure 1 for Beyond Memorization: The Challenge of Random Memory Access in Language Models
Figure 2 for Beyond Memorization: The Challenge of Random Memory Access in Language Models
Figure 3 for Beyond Memorization: The Challenge of Random Memory Access in Language Models
Figure 4 for Beyond Memorization: The Challenge of Random Memory Access in Language Models
Viaarxiv icon

Translating Natural Language to Planning Goals with Large-Language Models

Add code
Feb 10, 2023
Figure 1 for Translating Natural Language to Planning Goals with Large-Language Models
Figure 2 for Translating Natural Language to Planning Goals with Large-Language Models
Figure 3 for Translating Natural Language to Planning Goals with Large-Language Models
Figure 4 for Translating Natural Language to Planning Goals with Large-Language Models
Viaarxiv icon